WHAT IS CLAIMED IS: 



1. A computer accessible medium encoded with instructions which, when executed: 

5 replicate a checkpoint segment from a first local storage of a first node to at least 

one other node, wherein the checkpoint segment is stored into the first 
local storage by an application, and wherein the checkpoint segment 
comprises a state of the application; and 

10 load a copy of the checkpoint segment from the at least one other node into a 

second local storage of a second node responsive to a request from the 
second node to load the copy, wherein the second node is to execute the 
application. 

15 2. The computer accessible medium as recited in claim 1 wherein the checkpoint 
segment is identified by a checkpoint segment name that is unique within a cluster 
including the first node and the second node. 

3. The computer accessible medium as recited in claim 1 wherein the at least one other 
20 node comprises two or more other nodes. 

4. The computer accessible medium as recited in claim 1 wherein the request to load the 
copy is issued by the application executing on the second node. 

25 5. The computer accessible medium as recited in claim 1 wherein the request to load the 
copy is issued by an agent that monitors the application on behalf of a cluster server. 

6. The computer accessible medium as recited in claim 1 wherein the instructions, when 
executed, retrieve the copy from one of the at least one other node responsive to the 
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request. 



7. The computer accessible medium as recited in claim 1 wherein the at least one other 
node is configured to load the copy into a global storage, and wherein the instructions, 

5 when executed, retrieve the copy from the global storage responsive to the request. 

8. The computer accessible medium as recited in claim 1 wherein the instructions, when 
executed, replicate the checkpoint segment by performing at least: 

10 reading the checkpoint segment from the first local storage of the first node; and 

transmitting the checkpoint segment to the at least one other node. 

9. The computer accessible medium as recited in claim 1 wherein the instructions, when 
15 executed: 

receive a retrieval request in the at least one other node; and 
transmit the copy to a source of the retrieval request. 

20 

10. The computer accessible medium as recited in claim 1 wherein the instructions, when 
executed: 

receive the checkpoint segment from the first node in the at least one other node; 
25 and 

store the checkpoint segment in the at least one other node. 

11. The computer accessible medium as recited in claim 1 wherein the at least one other 
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node is configured to load the copy into a global storage, and wherein the instructions, 
when executed: 

receive the checkpoint segment from the first node; and 

store the checkpoint segment in the global storage. 

12. The computer accessible medium as recited in claim 1 wherein the at least one other 
node is not capable of executing the application. 

13. A method comprising: 

replicating a checkpoint segment from a first local storage of a first node to at 
least one other node, wherein the checkpoint segment is stored into the 
first local storage by an application, and wherein the checkpoint segment 
comprises a state of the application; and 

loading a copy of the checkpoint segment from the at least one other node into a 
first local storage of a second node responsive to a request from the second 
node to load the copy, wherein the second node is to execute the 
application. 

14. The method as recited in claim 13 wherein the at least one other node comprises two 
or more other nodes. 

15. The method as recited in claim 13 further comprising issuing the request to load the 
copy by the application executing on the second node. 

16. The method as recited in claim 13 further comprising issuing the request to load the 



copy by an agent that monitors the application on behalf of a cluster server. 

17. The method as recited in claim 13 further comprising retrieving the copy from one of 
the at least one other node responsive to the request. 
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18. The method as recited in claim 13 wherein the at least one other node is configured 
to load the copy into a global storage, the method further comprising retrieving the copy 
from the global storage responsive to the request. 

10 19. The method as recited in claim 13 wherein replicating the checkpoint segment 
comprises: 

reading the checkpoint segment from the first local storage of the first node; and 
15 transmitting the checkpoint segment to the at least one other node. 

20. The method as recited in claim 13 further comprising: 

receiving a retrieval request in the at least one other node; and 

20 

transmitting the copy to a source of the retrieval request. 

21. The method as recited in claim 13 further comprising: 

25 receiving the checkpoint segment from the first node in the at least one other 

node; and 

storing the checkpoint segment in the at least one other node. 
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22. The method as recited in claim 13 further comprising: 

receiving the checkpoint segment from the first node; and 
storing the checkpoint segment in a global storage. 

23. The method as recited in claim 13 wherein the at least one other node is not capable 
of executing the application. 

24. A cluster comprising a plurality of nodes, wherein a first node of the plurality of 
nodes is configured to replicate a checkpoint segment from a first local storage of the first 
node to at least one other node of the plurality of nodes, wherein the checkpoint segment 
is stored into the first local storage by an application executing on the first node, and 
wherein the checkpoint segment comprises a state of the application, and wherein a 
second node of the plurality of nodes is configured to load a copy of the checkpoint 
segment from the at least one other node into a second local storage of the second node 
responsive to a request to load the copy, wherein the second node is to execute the 
application. 

25. The cluster as recited in claim 24 wherein the at least one other node is not capable of 
executing the application. 
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